Spectral modification for concatenative speech synthesis

نویسندگان

Johan Wouters

Michael W. Macon

چکیده

Concatenative synthesis can produce high-quality speech but is limited to the allophonic variations and voice types that were captured in the database. It would be desirable to modify speech units to remove formant discontinuities and to create new speaking styles, such as hypoor hyper-articulated speech. Unfortunately, manipulating the spectral structure often leads to degraded speech quality. We investigate two speech modi cation strategies, one based on inverse ltering and the other on sinusoidal modeling, and we explain their merits and shortcomings for changing the spectral envelope in speech. We then propose a method which uses sinusoidal modeling and represents the complex sinusoidal amplitudes by an all-pole model. The all-pole model approximates the sinusoidal spectrum well, both in the amplitude and in the phase domain. We use the sinusoidal + all-pole model to control the spectral envelope in recorded speech. High-quality modi ed speech is generated from the model using sinusoidal synthesis. A perceptual test was conducted, which shows that the model was e ective at changing vowel identities and was preferable over residual excited LPC.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Source-filter models for time-scale pitch-scale modification of speech

This paper presents two time-scale pitch-scale modification techniques to be used in speech synthesis systems. They have been applied to Microsoft’s Whistler system, which is based on concatenative synthesis. Both methods are based on a sourcefilter model, one of them using LPC parameters and the other one using cepstral parameters. The proposed methods achieve high quality prosody modification...

متن کامل

Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling

In this paper we present a method for speech modeling and its utilization in IBM’s small footprint concatenative text-tospeech system. The method is based on frequency-domain, complex spectral envelope modeling, where the phase component plays a crucial role in attaining high quality speech synthesis. The modeling scheme presented enables low bit rate compression of the amplitude and phase info...

متن کامل

Estimation of Spectral Mismatch for Joint Cost Evaluation in Marathi TTS

Among different methods of speech synthesis, Concatenative Speech Synthesis is widely used due to its naturalness and less signal processing requirement. But concatenative TTS has problems like requirement of large database and resulting spectral mismatch in output speech. In concatenative TTS position of syllable plays very important role while carrying out segmentation. If proper position syl...

متن کامل

Generating emotional speech with a concatenative synthesizer

We describe the attempt to synthesize emotional speech with a concatenative speech synthesizer using a parameter space covering not only f0, duration and amplitude, but also voice quality parameters, spectral energy distribution, harmonics-to-noise ratio, and articulatory precision. The application of these extended parameter set offers the possibility to combine the high segmental quality of c...

متن کامل

Residual-based speech modification algorithms for text-to-speech synthesis

This paper presents a set of novel algorithms for the signal modification component of concatenative text-to-speech systems. The algorithms described here are based around the LPC analysis/synthesis framework, and achieve prosodic modification by time-domain processing of the LPC residual. The modified residual is then recombined with the all-pole spectral estimate to synthesise the new speech ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2000

Spectral modification for concatenative speech synthesis

نویسندگان

چکیده

منابع مشابه

Source-filter models for time-scale pitch-scale modification of speech

Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling

Estimation of Spectral Mismatch for Joint Cost Evaluation in Marathi TTS

Generating emotional speech with a concatenative synthesizer

Residual-based speech modification algorithms for text-to-speech synthesis

عنوان ژورنال:

اشتراک گذاری